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INTRODUCTION 

The present investigation is essentially an attempt to devise 
a statistical method for treating learning data. Part I is a dis- 
cussion of correlation methods and empirical and rational equa- 
tions. Part II is a description of the learning curve equation 
and its interpretation. Part III is a discussion of the application 
of the learning curve equation to typewriter learning. Part IV 
is a summary. 

Learning curves are usually very erratic and for this reason 
it is necessary to study the general trend of numerous observa- 
tions instead of the variable individual observations. The 
methods to be discussed often make it possible to obtain co- 
efficients which express the characteristics of a subject's learn- 
ing based On all the observations and in such a manner that all 
observations are as far as possible equally weighted. Quantita- 
tive methods in psychology are far in advance of our control 
over the things measured, and consequently we make ourselves 
subject to ridicule when refined correlation statistics are applied 
to measures which are obviously crude. We shall therefore dis- 
cuss not only the more refined statistical procedure for the learn- 
ing curve but also some readily applied methods which are 
adaptable in the study of more or less erratic learning data. 
Even though refined technique is available we should select the 
quantitative methods for any particular study so as to keep a 
fair balance between the certainty of our measures and the statis- 
tical niceties by which we treat them. 

For the benefit of any reader who wishes to apply the statis- 
tical methods to be described for his own learning data I wish 
to call attention to the first two sections of the summary in which 
will be found an outline describing the detailed procedure in 
calculating the learning coefficients. 

In applying these methods of learning curve analysis one 
should be fully aware of their limitations. They are not ap- 
plicable to the following conditions of learning : i ) when trial 
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and error learning is mixed with generalizations such as in puzzle 
solving; 2) when the learning is so erratic that it fails to show 
continuity; 3) when the learning process has not been carried 
far enough to reveal the nature of the function; which is often 
the case with apparently linear learning curves; 4) when the 
learning curve is not plotted in the speed-amount form; 5) when 
the learning curve fails to show diminishing returns with prac- 
tice; 6) when the units of formal practice are variable in the 
different stages of learning (learning measured on different 
successive scales can not be treated as a continuous function) ; 
7) when the wrong responses are eliminated by ideational learn- 
ing without giving any objective scores during the process of 
elimination. Such learning curves have the same appearance 
as those which contain generalizations. 

1 ) Verbal Statement of Relationship 

Our present problem concerns the relationship between prac- 
tice and attainment in learning. When an observer notes as an 
element in common experience that attainment increases as prac- 
tice increases, he may generalize by verbally asserting a positive 
relation between the two variables. The verbal generalization 
is so common that it is embodied in what we call common sense. 
Thus we expect without further verification that twenty hours 
of practice in a complex function will yield higher attainment 
than ten hours of practice under roughly similar conditions, but 
uncontrolled observation does not tell us how much higher. 

2) The Correlation Coefficient as an Expression of Re- 

lationship 
It is possible to express by a single number the degree of rela- 
tionship between two variables. This is what one attempts to 
do by a correlation coefficient. The Pears6n coefficient of corre- 
lation is so derived that when its value is unity the two variables 
have perfect concomitance. When its value is — 1. the two 
variables have perfect inverse relationship, a rise in one of the 
variables being always associated with a proportional decrease 
in the other. A zero correlation establishes the fact of entire 
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absence of relationship within the conditions of the experiment. 

A correlation coefficient considerably less than unity may be 
explained in at least four different ways: i) the observations 
themselves may be so inaccurate as to obscure the relationship; 
2) the two variables may be related through a common third 
variable which, if not controlled or kept constant, plays havoc 
with the experiment; 3) the regression may be non-linear in 
which case the Pearson coefficient, r, is almost meaningless;* 
4) the two variables may be intrinsically independent. Psycho- 
logical experimentation rarely yields correlations over 0.85 be- 
cause of the inaccuracy of psychological measures. When a 
correlation coefficient turns out to be 0.95 or above an empirical 
equation may properly be substituted for the correlation methods. 

In the interpretation of a correlation coefficient one should be 
careful to note that while a high correlation coefficient does indi- 
cate a relation between the variables under the conditions of the 
experiment, a low coefficient does not indicate the absence of 
relationship between the variables. The first, second, or third 
factors enumerated in the preceding paragraph may be responsi- 
ble for a low coefficient when a high relation really exists. It is 
perhaps rare that a correlation is calculated with psychological 
data which is not grossly affected by all three of these factors. 

When one variable is immediately contingent on one or more 
other variables it is advisable to use the method of partial corre- 
lation to establish the relation. Thus the volume of a box is 
contingent on the three variables, length, width, and depth. Now, 
if a heterogeneous collection of wooden boxes were to be meas- 
ured as to all four of these attributes and the correlation co- 
efficient between volume and length determined, it would un- 
doubtedly turn out to be positive and significant because long 
boxes are usually more voluminous than short boxes. But the 

♦The term regression was introduced by Galton in connection with his 
statistical studies in the heredity of stature. It is the equation of the best 
fitting line for a series of paired observations of two variables. The use 
of the term seems to be restricted to data of considerable dispersion and is 
not used for those observations in the exact sciences which do not involve 
serious scatter. The term linear regression refers to the equation of a re- 
gression line which is straight, as contrasted with non-linear regressions 
which are curved. See Yule, 'Textbook in Theory of Statistics," p. 176 
and references 2 and 3 on p. 188. 
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coefficient would not be unity because of the two other variables 
which were left out of consideration. This would illustrate 
case 2 in the preceding paragraph. We may distinguish two 
methods of handling this type of relation, a) We may control 
the extraneous variables by keeping them constant while meas- 
uring the two variables with which we are immediately con- 
cerned. This is the customary procedure in the physical sciences. 
Thus when verifying Boyle's Law we keep the temperature of 
the gas constant, but when verifying Charles 9 Law we keep the 
pressure constant. Except for these precautions neither of the 
two laws would be observed. In the biological sciences we do 
not have such ready control over the extraneous variables and 
the best we can do is to measure them also, while allowing them 
to vary at will, b) In these cases we use a second method, 
namely, partial correlation. Thus if we calculate the partial 
correlation coefficient between volume and length of the collec- 
tion of wooden boxes, with the depth and width accounted for, 
we obtain a higher coefficient than when the two extraneous 
variables were ignored. The advantage of the partial correla- 
tion method is that it enables us to control the extraneous vari- 
ables analytically without having any physical control over them. 
There are some limitations of the correlation methods which 
every experimenter should keep in mind in order to guard 
against erroneous conclusions. One of the limitations of the 
partial correlation method is that it assumes the combined effect 
of the several independent variables on the dependent variable 
to be additive, a condition which rarely obtains. This limitation 
is much more serious than the inadequacy of the correlation 
coefficient for non-linear regressions. A non-linear regression 
can usually be rectified by one of the algebraic artifices used in 
connection with empirical equations, but the assumption that the 
several independent variables produce their effect on the de- 
pendent variable in an additive manner can not be handled by 
any predetermined statistical method. Thus, returning to the 
box illustration, the best measure we could obtain by the method 
of partial correlation is expressed in the form 

v = kjd + ^2 W + ^3!- l 
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But the true formula for the volume takes the form 

v = k.d.w.l. 2 

Now, this type of relation is not revealed by the partial cor* 
relation method, nor is the method adequate for any of the 
thousands of ways in which the several variables may combine 
except the additive one. 

If we are not content with merely stating in quantitative form 
the degree of relationship between the two variables but wish to 
formulate a method of prediction, we use the regression equa- 
tion. This equation is derived from the Pearson coefficient 
which merely states in numerical form the degree of relation- 
ship between the two variables. It places the information at 
our immediate command for the purpose of prediction. Given 
one of the unknowns, the other can be found either from the 
regression lines or from the regression equation which is simply 
an algebraic description of the regression line. 

It is quite conceivable that a low value of attribute A may 
be associated with either high or low value of attribute B, 
whereas a high value of attribute A may be associated with 
only high values of attribute B. Similarly a low value of at- 
tribute B may be associated with low values of attribute A only, 
whereas high values of B are associated with either high or 
low values of attribute A. Whenever such conditions obtain 
the Pearson correlation coefficient is inadequate to express the 
complete relationship. In these cases, called non-linear regres- 
sions, it is advisable to calculate another kind of coefficient which 
is called the eta coefficient, %* or correlation ratio. The sig- 
nificance of the correlation ratio may perhaps be made more 
apparent by the analogue All dogs care quadrupeds but all quad- 
rufieds are not dogs. What would correspond to the* correlation 
ratio of dogs on quadrupeds would be very high because all 
dogs are quadrupeds. But the correlation ratio of quadrupeds 
on dogs would be low, for only a few of the quadrupeds are 
dogs. The Pearson correlation coefficient for a relation of this 
type would be positive but low. 

♦For a brief statement giving the derivation of the correlation ratio, see 
Yule, p. 204. 
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There are a number of algebraic artifices by means of which 
a non-linear regression may be rectified. The value of such 
devices becomes apparent when it is considered that such other- 
wise exceedingly useful tools as are available in the correlation 
methods are inapplicable as long as the regressions are non- 
linear. The investigator must rely on his own ingenuity in rec- 
tifying a non-linear regression. Some of these methods will be 
considered in connection with the learning curve equation. . 

3) Empirical Equations 

Every equation can be represented by a line in a diagram and 
practically every line encountered in quantitative experimental 
work can be represented by an equation. Thus the regression 
equation is only an algebraic way of describing the regression 
line of a scatter diagram, or, putting it the other way, the re- 
gression line is a graphical description of the regression equa- 
tion. Each tells the same story in its respective language. 

An empirical equation is an equation selected to fit a given 
set of data. The observations give us the diagram and if we 
find an equation whose line coincides with the general trend of 
the observations, it may be used interchangeably with the dia- 
gram for predicting one of the attributes when the other is 
given. When the observations indicate a linear relation we can 
derive the corresponding equation with very little trouble, but 
when the observations fall along a curve and when they are 
badly scattered the finding of the most representative empirical 
equation sometimes taxes the investigator's ingenuity. I shall 
describe the routine steps in determining the empirical equation 
for a linear relation by means of an example and will show that 
it turns out to be identical with the regression equation for the 
same data. 

Figure 1 is a diagram of the relation between two hypo- 
thetical variables X and Y. Each of the small circles repre- 
sents a hypothetical observation; the solid line represents the 
general trend of the observations. This line may be used for 
the purpose of prediction. Thus if we know that on a certain 
occasion attribute X had a numerical value of 9, the attribute Y 
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must have been very close to the value 63, as read from the 
chart. Our problem now is to describe this line algebraically 
so that the prediction may be made by means of a formula in- 
stead of by the diagram. 

a) Method of inspection. This procedure is the simplest but 
it can only be applied when the relation is close, as it is in the 
present illustration. We first indicate the observations by small 
circles or dots on the diagram. Then we draw by inspection the 
best fitting straight line through the general trend of the ob- 
servations. The equation of a straight line always takes the form 

Y = a + b.X 3 

The y-intercept is 30.8 and it is the constant a. The slope of 
the line is 3.56 and it is the constant b. Hence the equation for 
the line is 

Y = 30.8 + 3.56X 4 

This equation may be used interchangeably with the diagram 
in predicting one of the attributes when the other is known. 
The procedure is so simple and direct that it would be uni- 
versally used, were it not for the fact that when the observa- 
tions scatter badly, it is difficult to draw the best fitting straight 
line by inspection. Moreover, the same line can not properly 
be used in predicting X from Y as in predicting Y from X 
when the data are scattered. In these cases we have recourse 
to two other types of procedure, namely, the method of the re- 
gression equation and the method of least squares. 

b) Method of regression equation. In figure 2 we have repre- 
sented a hypothetical set of data which are quite scattered. By 
the usual correlation methods we obtain the following constants : 

r =+ 0.59 
*x = 4-5 8 

°j = 303 
n = 50. 

m x = 10. 

m y = 8. 

The regression equation with two variables for predicting X 

from Y takes the form 
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°* 



x = r„ y — y 



°i 



in which x and y are deviations of X and Y from their respective 
means. Rewriting this equation in terms of the variables X and 
Y instead of in terms of the deviations from their means, we have 

X — mx = r — (Y — m y ) 6 



°7 



in which m x and m r are the arithmetic means of X and Y re- 
spectively. 

Substituting the numerical values into equation 6 we have 

v . 0.59X4.58 ox 

X — 10 = (Y — 8) 7 

303 

and simplifying, we obtain 

X = 0.89Y + 2.88 8 

This is the regression equation ready for use. By means of 

it we predict X when the value of Y is known. When the data 

are not seriously scattered it is safe to fit the line by inspection 

and determine the empirical equation by the shorter method, 

but with the data of figure 2 the regression lines can hardly be 

judged by inspection. 

By analogy the regression equation for predicting Y from X 

takes the form 



°i 



y = r ir — x 



*x 



which when stated in terms of the variables instead of in terms 
of the deviations from their respective means becomes 

Y — 1^ = ^— (X — m x ) .10 



°x 



Substituting and simplifying, as before, we get as a numerical 
statement of the relation that 

Y = 0.39X + 4.i n 

which is ready for use in predicting the most probable value of 
Y for a known value of X. 

These regression equations may be obtained with less arith- 
metical labor, particularly if only one of the regression equations 
is needed, by the method of least squares. 
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c) Method of least squares. The method of least squares is 
an aid in finding the best fitting straight lines for representing 
a series of observations. It can be applied also to curves but 
that often leads to awkward mathematical maneuvering. The 
method of least squares gives a line such that the sum of the 
squares of the deviations of the independent variable from the 
regression line is a minimum. It is the best fitting straight line 
for the observations. The method gives the numerical values 
of the constants a and b in the equation 

X = a + b.Y 3 

which represents any straight line and by means of which we 
can predict the most probable value of X from a given value 
of Y. The application of the method consists in solving the two 
following formulae: 

S(Y).S(X.Y) — S(Y*).S(X) 



P(Y)]» — n.S(Y») 



12 



_ S(Y)S(X)— n*(XY) 

_ [2(Y)] 2 — n.S(Y a ) I3 

Substituting the appropriate sums from the data we find that 
a = 2.9 and b = o.89. Hence the equation of the best fitting 
line for predicting X from Y is 

X = 2.90 + 0.89Y 14 

It should be noted that this equation, as determined by the 
method of least squares, is identical with the regression equation 
8 which was determined by correlation methods. 

By analogy, the equation for the straight line by which the 
most probable value of Y may be determined from a known 
valu$ of X is 

Y = c + d-X 15 

The constants c and d may be determined from the original ob- 
servations by the formulae: 

^S(X).S(X.Y)— 3(X 2 )S(Y) 

C [2(X)] 2 — n-2(X 2 ) IS 

S(X).*(Y)-n.S(X.Y) 

d== . ,16 

[S(X)] 2 — n-2(X 2 ) 
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Substituting the numerical value of the data from figure 2 into 
equations 15 and 16 we obtain: = 4.09, and b==o.39. Sub- 
stituting these numerical values into 15 we get 

Y = 4.09 + 0.39X 17 

by means of which we may predict the most probable value of 
Y from a known value of X. 

It can be shown readily that when the regression equations 
have been determined by the method of least squares the Pearson 
coefficient of correlation is expressed by the relation 

r = V bd * 18 

in which b and d are identical with the regression coefficients. 

The above calculations make it apparent that the method of 
least squares gives us a pair of regression lines which are iden- 
tical with those obtained by the correlation methods. For the 
purpose of stating quantitatively the degree of relationship be- 
tween two variables it is desirable to calculate the correlation 
coefficient. When the regression equation is of primary interest 
it can be calculated to advantage by the method of least squares 
particularly if only one of the regressions is needed. 
4) Rational Equations 

A rational equation is derived from known relations and is 
verified by experimental observation. From a philosophic point 
of view one may argue that all equations used by science are in 
the last analysis empirical but in practice there is a far cry be- 
tween fitting an empirical equation to a series of observations 
and the ability to predict the observed relations on the basis of 
a rational equation. 

Psychology has very few bona fide laws on which we can 
build a system of quantitative prediction and control. Thus 
practically all we can do with the problem of learning is to ob- 
serve the function and describe it. The present attempt is to de- 
scribe it quantitatively by an empirical equation. Some day we 
shall possess in psychology a coordinated system of really work- 
able concepts with objective reference by which we may be able 
to predict and control at least certain aspects of behavior by 
rational .equations or their equivalents. 

♦See Yule, p. 203. 



II. THE LEARNING CURVE EQUATION 

i ) Purpose of the Equation 

When the learning function for a simple coordination pro- 
ceeds undisturbed by external or internal distraction it usually 
follows a law of diminishing returns. In the majority of learn- 
ing curves the amount of attainment gained per unit of practice 
decreases as practice increases. Exceptions to this tendency are 
found in studying the learning of complex processes such as a 
foreign language, and when successive generalizations are in- 
volved such as puzzle solving and the like. These exceptions 
sometimes take the form of a positive acceleration at the initial 
stage of the learning, plateaus during the course of learning 
and erratic advance of attainment. But these irregularities 
should ntft stand in the way of an attempt to express the learn- 
ing f unction as a law provided that we do it with due conserva- 
tism in its interpretation. All we can hope to do in thus ex- 
pressing the learning function is to formulate what can with 
considerable certainty be considered as the typical relation be- 
tween practice and attainment. 

Besides giving the satisfaction of formulating the relation 
between practice and attainment, the use of an equation for this 
relation enables one to predict the limit of practice before it 
has been attained, provided that the learning follows the law 
of diminishing returns. It also enables one to differentiate for 
various purposes the rate of learning from the limit of practice 
since these two attributes are undoubtedly independent. It en- 
ables us to state how much preceding practice the subject has 
experienced under the assumption that the learning function 
followed the same law before and after the formal measure- 
ments. Another use for which the equation can be of service 
is in the analysis of the relation between the variability in learn- 
ing and other mental attributes. The problems of formal disci- 
pline may be investigated by ascertaining whether a succession 
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of learning processes, all of the same type, yields any rise in 
the limit of practice, or a higher rate of learning, or a greater 
consistency of learning in the successive learning processes. 
Some of these coefficients may be more susceptible than others 
to modification by successive repetition of the same type of learn- 
ing. This would in reality be studying the problems of learning 
how to learn. All questions of transfer of training may be in- 
vestigated by the learning equation and the transfer effect may 
be differentiated into its psychological components. Thus, con- 
tinued practice in learning poetry may show no rise of the prac- 
tice limit, but a considerable rise in the rate at which that limit 
is approached and in a decrease of the variability of the learning. 
Relearning may be found to approach the same limit of practice 
as the initial learning but it may proceed at a higher rate, and 
this rate can be stated as a coefficient which is independent oi 
the amount of previous practice in each learning process. The 
laws of forgetting are expressible in terms quite similar to those 
here used for the learning function. It is not at all unlikely that 
these coefficients may come to be significant in individual psy- 
chology quite apart from their immediate utility as descriptive 
attributes of the learning function. The preceding remarks 
have, I hope, justified my attempt to devise a method for in- 
vestigating the learning, memory, and forgetting functions. 
2) The Equation 

After experimenting with some forty different equations on 
published learning curves I have selected a form of the hyperbola., 
as being for practical purposes the most available. It takes the 5 
form 

LX 
Y = 19 

X + R 
in which 
Y = attainment in terms of the number of successful acts per 

unit time. 
X = formal practice in terms of the total number of practice 

acts since the beginning of formal practice. 
L = Limit of practice in terms of attainment units. 
R = Rate of learning which indicates the relative rapidity 
with which the limit of practice is being approached. It 
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is numerically high for a low rate of approach and nu- 
merically low for a high rate of approach. 

Equation 19 represents a learning curve which passes through 

the origin, i.e., it starts with a zero score at zero formal practice. 

The majority of learning curves start with some finite score 

even at the initial performance. For learning curves which do 

not pass through the origin, the equation becomes 

Y- L(X+P) 
(X+P)+R 

in which P = equivalent previous practice in terms of formal 

practice units. 

Figure 3 represents the learning curve for subject No. 23 in 
the group of fifty-one typewriter students to be discussed in a 
later section. This curve is plotted between attainment, Y, in 
terms of the number of words written in a four minute test 
given weekly for seven months, and formal practice (X) in 
terms of the total number of pages written since entering the 
course. We shall call this type of curve the speed-amount curve 
to distinguish it from other ways of plotting the same data. 

Equation 19 may be rectified as follows: 

X + R 
XY + RY = LX 

X + R=*L(^) 21 

This equation is linear if X/Y is plotted against X. Similarly 
equation 21 may be rectified when written in the form 

X+(R + P)=L (X +- P) 22 

which becomes linear when (X + P)/Y is plotted against X. 

When so rectified, the constants L, R, and P may be deter- 
mined by several different methods, the choice between which 
depends on the scatter of the data, the desired accuracy, and the 
number of curves one has to calculate. We shall describe four 
methods of calculating the coefficients. 
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a) Method of least squares 

Case i: when the learning curve passes through the origin: 
Arrange the data as in table i. Calculate X/Y and tabulate. 
Plot X/Y against X as in figure 4. For convenience we shall call 

x 7 

- = Z 23 

The reader will notice that the learning data as plotted in figure 
4 falls practically in a straight line whereas the same data in 
figure 3 takes the typical learning curve form. 

The equation for the best fitting straight line of figure 4 can 
be represented by the equation 

Z = c + d-X 24 

The constants c and d are determined from the table of data by 
the formulae: 

_ S(X).S(XZ) — S(X*).2(Z) 

C= [S(X)]» — n-S(X») 25 

2(X).2(Z)— n.S(XZ) 

d = ■ 26 

[2(X)] 2 — n2(X 2 ) 

which are simply the least square formulae (15) and (16) re- 
written for X and Z. Substituting the proper sums, we have 
c = o.42, and d== 0.0041. Hence the equation for X and Z 
becomes 

Z = 0.42 + 0-0041 X 27 

which by replacing X/Y for Z and transposing becomes 

Y- 244X 28 

X+102 

in which the predicted limit of practice, L, is 244 words in four 
minutes, and the rate of learning, R, is 102. The constants L 
and R may also be determined by the relations 

R = c/d 

Z= i/d 

Plotting equation (28) we obtain the solid line in figure 3. 

It will be noticed that this curve fits quite well the general trend 

of the observations which are indicated by the small circles. This 

method of stating algebraically the relation between practice and 
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attainment is of course not applicable unless the speed-amount 
curve for the data takes the typical hyperbolic form. 

Case 2: when the learning curve does not pass through the 
origin. When the learning curve does not pass through the ori- 
gin it can be rectified by slightly different procedure. We shall 
take as an illustration the combined curve for a group of fifty- 
one subjects studying typewriting. Figure 5 represents the 
average speed of typewriting against the total number of pages 
written since entering the course. It is seen to be a fairly smooth 
and regular curve. 

The equation for the learning curve which does not pass 
through the origin is 

y- L(X+P) 

(X + P)+R 
or, if we call 

P + R = K 32 

for convenience, we have, instead of equation (20) 

L(X + P) 

X + K M 

This equation can be rectified as follows : When X = o, and 
Y = Y lt Yj being the initial attainment score, 

LP 
Y X = _ 34 

and hence equation (33) becomes 

XY T X 

= L K 35 

Y — Y t Y — Yi 

This equation is linear if XY/(Y — Y t ) is plotted against 

X/(Y — Yi), in which case L is the multiplying constant and 

K is the additive constant. 

Plotting the data represented in figure 5 in this manner we 

obtain figure 6 in which the learning data appear as a straight 

line. This line may be represented by the equation 

S = a + b.T 36 

in which 

X A XY 
T = and S = 

Y — Y t Y — Y A 
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The numerical values of a and b may be determined by the fol- 
lowing least square formulae which are identical with equations 
12 and 13, except for the analogous notation. 

S(T)S(ST)— 5(T 2 )S(S) 



[S(T)] 2 — n*(T a ) 
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5(T).S(S)-nS(S.T) 

b = . 38 

[S(T)] 2 — nS(T 2 ) 

Substituting the proper sums we find that a= — 148. and 
b = 2i6. Hence 

S = 148. + 216.T 39 

which is the equation of the solid line in figure 6. This equation 
may be transposed into the original form of equation 20, or we 
may write it in that form directly by the following relations : 

a=K 

b = L 

P= -£*L 

L 

R=K — P 
All of the constants K, L, Y, P, and R, are positive when ap- 
plied to learning curves. It should be noted that Y x is a repre- 
sentative original score determined by projecting the learning 
curve back to the y-axis. In figure 5 the actually observed 
initial score was used since it is continuous with the rest of the 
data. But it is occasionally necessary to select a representative 
initial score since Y x is weighted in this procedure more than 
any of the other points. The numerical values of these con- 
stants for the data of figure 5 are as follows : 

L = 2i6. 

P= 19. 

R = 133. 
Substituting these constants in equation 55 we have 

2i6.(X + i9.) 

Y = 40 

X + 148. 

which when plotted becomes the solid line of figure 5. The 

reader will notice that this equation, as represented by the solid 
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line in figure 5, is a beautiful fit for the data, and it justifies 
our use of equations ip and 20 to represent the hyperbolic form 
of learning curve. 

The predicted limit of practice L which is 216. words in four 
minutes, is of course based on the assumption that the learning 
curve would continue as uniformly beyond the measurements 
as it did during the measurements. This limitation must be kept 
in mind and we shall therefore differentiate between the pre- 
dicted limit and a limit of practice which has been practically 
attained. The equivalent previous practice (P) is ip pages 
which we may interpret as the average number of pages of type- 
writing to which. the previous general experience of our subjects 
was equivalent. This interpretation of the constant P is also 
limited by the assumption that the unmeasured learning function 
followed the law which the measurements reveal. One circum- 
stance which bears out this assumption is that those learning 
curves which actually do pass through the origin and which do 
not show positive acceleration usually follow this curve law 
when the coordinates are properly chosen. The curve of figure 
5 does not pass through the origin but this is explainable by the 
fact that a person who has never touched a typewriter will in 
four minutes make some finite score even though handicapped 
by using the hunt and punch method. 
b) Method of inspection 

When the observations fall very nearly in a straight line as 
they do in figure 6 it is hardly necessary to plough through the 
arithmetical labor involved in evaluating the constants a and b 
of equation 36 by the method of least squares unless one has 
ready access to a calculating machine. After plotting figure 6 
one may draw at sight the best fitting straight line through the 
general trend of the data and evaluate the constants from any 
of the following relations: 

y-intercept = a = K 

x-intercept = K/Y x 

slope = L 

P = - 

L 

R = K — P 
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By this graphical procedure much labor is saved in calculating 
the learning curve constants and the method is identical with 
the preceding in principle, 
c) Method of three equidistant points 

The learning coefficients may be determined from three se- 
lected points with less labor than when all the observations are 
taken into account. These three points should be so selected that 
they represent the general trend of the learning curve. 

Let the three selected points be denoted Xj^; X 2 Y 2 ; and 

X 8 Y 8 . Let Xj be zero, X 8 the total amount of practice and X 2 

the midpoint between X 1 and X 8 . Let the Y-values be the most 

representative ordinates to the curve. Then 

X 8 = 2-X 2 4* 

By substituting these values into equation 20, transposing and 

simplifying, we obtain 

T _ X 8 (Y 2 — X 8 ) 

K = — 42 

Y 8 + Y 1 -2Y 2 ^ 

Y 8 (X 8 + K)^-Y 1 .K 
L = 7Z 43 



X 



3 



R = K — P 45 

From these relations we may determine the numerical values of 
the learning coefficients in terms of the three equidistant points. 
When the curve passes through the origin both X x and Y x 
are zero. The coefficients may then be determined by the fol- 
lowing somewhat simpler relations : 

X 8 (Y 2 -Y 8 ) 

K = 46 

Y 3 — 2-Y 2 

Y 8 (X, + K) 

L = 47 

X 8 

R = K 

P is zero because when the initial score is zero the equivalent 

previous practice is zero. 



THE LEARNING CURVE EQUATION 19 

3) Interpretation of Learning Constants 

We have seen that the learning curve equation 20 fits very 
well the learning data to which we have applied it. In order 
to bring out the interpretation of the learning coefficients we 
shall compare several learning curves with high and low nu- 
merical values of the coefficients. 

In figure 7 we have two hypothetical learning curves with dif- 
ferent physiological limits but with identical rates of approach. 
Figure 8 represents two hypothetical learning curves, both ap- 
proaching the same limit of practice, one at a high rate and the 
other at a low rate. Figure 9 represents two hypothetical learn- 
ing curves with same limit of practice, and with the same rate 
of approach, but differing in the amount of previous practice. 
Curve A represents forty units of previous practice while curve 
B represents no previous practice. The two curves are identical 
in shape, the only difference between them being that curve B 
is forty x-units to the right of curve A. The same interpreta- 
tion would be reached if the two curves were superimposed and 
the formal practice measurements started at the origin for curve 
B and after forty practice units for A. 

4) The Coordinates for Learning Curves 

So far we have considered learning curves plotted only be- 
tween the coordinates X (total number of practice acts since 
the beginning of practice) and Y (the number of successful acts 
per unit time). Learning curves have, however, been plotted 
with various units for the coordinates and we shall consider 
several of these together with some inferences that may be drawn 
from the translation of learning data from one system of units 
to another. 

The speed-amount curve is the name we shall use to designate 
the form of learning curve we have been considering. It is 
plotted as speed, Y, against amount of practice, X. It may be 
represented by equations 19 and 20 when it reveals the typical 
hyperbolic form. 

The time-amount curve is plotted as time, t, per unit amount 
of work against total amount of work, X, since the beginning 
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of practice. It is evident that the ordinates of this type of curve 
will be proportional to the reciprocals of the speed-amount curve 
for the same data. Hence we may define t as 

t = — 48 

Y 

where t is the time per unit amount of work and C is a constant 

Limiting ourselves to the curves of diminishing returns we have, 

as the equation of the time-amount curve 

C(X + K) 

t = 49 

L(X + P) 

The constant C is only significant in translating learning 
curves from one form to the other. Applying the equation di- 
rectly to learning data the constant C may be dropped. In that 
case 

X + R 

t = 50 

L(X + P) 

This equation may be rectified by the procedure previously out- 
lined for equations 19 and 20. When P = zero, we have 

X + R 

t = 5 1 

LX 

which can be rectified by plotting tX against X. 

In order to determine whether equations 50 and 51 really fit 
the time-amount curve throughout its range I have given a long 
substitution test to one of my students. He took the test seven- 
teen times, once a day, and reached what is for all practical 
purposes a practice limit. The time-amount curve for this learn- 
ing test is represented in figure 11. In figure 12 I have rectified 
the data by plotting the products tX against X. The reader 
will notice that the speed-amount curve is hyperbolic. It is quite 
gratifying that the learning records for an individual subject 
follow the hyperbolic law so closely. In order to avoid erratic 
scores from individual subjects it is absolutely essential that 
they work under uniform conditions with a minimum amount 
of distraction. The student whose substitution learning is repre- 
sented in figures 11 and 12 took the test once a day only and 
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always at 1 P. M. The test consisted in making six hundred 
substitutions at each sitting. 

The time-time curve is the learning curve plotted between the 
time, t, per unit amount of work and the total time, T, devoted 
to practice. An empirical equation may be derived for this type 
of curve from the assumed hyperbolic form of the speed-amount 
curve. The total time is the summation St-dx for the whole 
period of learning. Hence 

T=ftdx 52 

But from equation 50 

X + K 

t== 50 

L(X + P) 

and hence 

r X + K 

T _ r — — — dx 

1 — J L(X + P) * 

which may be written 

1 ,, K — P r dX 

T = — f dX + f 

L J L J X + P 

Integrating, we obtain 

T = - + -^— log(X + P)+C 1 53 

which gives the equivalent total time T in terms of X. Stating 

X explicitly from equation 50 and substituting in equation 53 

gives the desired relation between T and t as 

K — t-L-P K — P K — P 

T = —- - + — — log(— — )+C 2 54 

(Lt — i)L L L-t — 1 

While this equation does give us a relation between T and t as 
derived from the hyperbolic speed-amount curve it is too un- 
wieldy to be practically feasible. We are hardly justified in 
using so complex an empirical equation for learning data. 

The speed-time curve is plotted between the speed Y (number 
of successful acts per unit time) and the total amount of time, 
T, devoted to practice. An equation between Y and T may be 
derived by stating X explicitly from equation 20 in terms of Y 
and substituting this for X in equation S3 which gives 
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p_K K — P K — P 
T=^— r + -^— logY __log(L-Y)+C 2 55 

While this equation is too cumbersome for extensive use it 
serves one very interesting function in that it sheds light on the 
question of positive acceleration in learning curves. 

5) Initial Positive Acceleration in the Speed-Time Curve 

Equation 55 represents the speed-time curve. It may be sim- 
plified by letting 

A = P — KandB= K ~ P SSa 

when it becomes 

T=— ^— + BlogY — Blog(L — Y)+C 2 56 

The first derivative with respect to Y is 

dT A , B , B 

= — ■ 57 

dY (Y — L) 2 Y L — Y 

The second derivative is 

d 2 T = 2A B B 

dY 2~ (Y — L) 3 Y 2 (L — Y) 2 S 

which when simplified becomes 

d 2 T B.(L) 2 .(3.Y — L) 
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dY 2 Y 2 (L — Y) 8 

since A = — BL from equation 55a 
Equating the second derivative to zero, we have : 

B.(L) 2 .( 3 .Y — L) = 

Y 2 (L — Y) 8 _ ° 
which is true when Y has the value L/3. 

This demonstrates the presence of a point of inflection in 
equation 55 at the value L/3 for Y. The psychological signifi- 
cance of this relation may be stated as follows : 

The learning curve in the speed-time form must necessarily 
have an initial positive acceleration which changes to a negative 
acceleration when the attainment has reached one-third of the 
limit of practice. This conclusion is contingent on the assumpr 
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Hon that the learning curve in the speed-amount form is hyper- 
bolic, an assumption which has been empirically shown to be safe 
for the majority of learning curves. As has already been said, 
the speed-amount curve is usually hyperbolic but not always. 
These assertions regarding the speed-time curve are not applica- 
ble when the speed-amount curve for the same data is not hyper- 
bolic. The positive acceleration can not, of course, be obtained 
when the initial score is greater than one-third of the practice 
limit. It can only be observed when the initial score is less than 
one-third of the practice limit. 

In order to test empirically the above finding with regard to 
initial positive acceleration I have plotted in figure 13 the average 
typewriting speed for fifty-one subjects against weeks of prac- 
tice (the speed-time curve) instead of against total number of 
pages written (the speed-amount curve). The average practice 
limit for this group has already been found to be 216 words in 
four minutes according to the speed-amount curve for the same 
data. The reader will notice the initial positive acceleration fol- 
lowed by negative acceleration, and also that the point of transi- 
tion from positive to negative acceleration takes place at a 
writing speed of about seventy words in four minutes, as it 
should do according to our analysis of the speed-time curve. If 
this finding will stand the test of further experimentation it is 
obviously of considerable diagnostic value for the psychologist 
who can by means of it predict the practice limit when attain- 
ment reaches one-third of its limit. The limitation in the use 
of this relation is mainly in the erratic improvement in complex 
coordinations which are learned tinder variable conditions of 
distraction and in the occasional deviations from the typical 
hyperbolic form of the speed-amount curve. 

6) Other Possible Equations 

Before closing the discussion on the learning curve equation 
as such it might not be out of place to mention a few of the other 
equations which I have tried to use for learning data. These 
will not be of interest to the general reader but may be of in- 
terest to those who wish to try their hand at other empirical 
equations for the learning function. 
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One of these equations for the speed-amount curve is 

Y = L[i 1—] 60 

e~ 

where e is the Naperian base or some other constant This equa- 
tion can not readily be rectified except by trying successive values 
for L. When the proper value for L is found it can be rectified 
when written in the form 

log (L — Y) = log L — a-Xlog e 61 

by plotting log(L — Y) against X. If the curve does not pass 
through the origin equation 60 becomes 

Y = L[i -i- ] 62 

e »(x+p) 

which is rectified if the proper numerical values of L and P are 
found by writing it in the form 

log(L— Y) = log L— a(X+P)log e 63 

and plotting log(L — Y) against (X+P). This equation gives 
a fair approximation to the speed-amount curve but it does not 
fit nearly as well as the hyperbolic form previously considered. 
It can be rectified graphically by plotting Y-increments against X 
but this procedure is not feasible unless the individual observa- 
tions are more consistent than they usually are for learning data. 
The constants L and P can also be determined graphically from 
three selected points. If X t Y lf X 2 Y 2 , and X 8 Y 8 be three points 
on the curve, equidistant on the axis of abscissae, then the two 

• 

lines X 8 Y 8 ; X 2 Yi and X 8 Y 8 ; X a Y 8 will intersect in a point which 
is on the asymptote parallel to the axis of abscissae, thus deter- 
mining the constant L graphically. This equation gives a fair 
approximation to the speed-amount curve but it does not fit 
nearly as well as the hyperbolic form previously considered. 
Another equation which gives a fair approximation to the 

learning curve is 

c 

Y = L(B*) 64 

in which B and C are constants. It can be rectified by writing 
it in the form 

log Y = log L — — log B 65 
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and plotting logY against i/X. It has the advantage of sim- 
plicity and it can be used to represent an initial positive accelera- 
tion. But as far as I have been able to determine the constant L 
does not agree as well with observed values as the hyperbolic 
form. 

One could perhaps write an indefinite number of exponential, 
trigonometric and other functions to represent the learning curve 
but as long as the simple equation 20 with its various trans- 
formations fits the data, and as long as we do not have the basis 
for a rational equation for learning I have been content to 
abide by it 



TYPEWRITER LEARNING 

i) The Subjects 

Eighty-three students at the Duff Business School in Pitts- 
burgh took one four minute typewriter test once a week during 
the school year 191 6-17. The tests were begun in September 
and continued until the middle of April. The subjects practiced 
two hours of school schedule time every day, five days a week. 
No tests are available for the first three weeks of practice be- 
cause teachers of typewriting who use the touch system prefer 
not to give tests from straight copy until the mechanism of the 
typewriter and the key board have been mastered. This takes 
from three to seven weeks, depending on the maturity, adapta- 
bility and industry of the students. Practically all of the sub- 
jects had finished the grammar school, a number of them had 
completed one or two years of high school, and several had fin- 
ished a four year high school course. Their average age was 
about seventeen years. In order to obtain an initial typewriting 
score I asked ten of my students who had never touched a type- 
writer to take a four-minute test. The average score for this 
group was 27 words in four minutes and this is used with the 
other data as an average initial score in typewriting. 

Of the eighty-three subjects who took the tests thirty-two 
were eliminated, leaving fifty-one subjects for the major study. 
The causes of elimination are indicated in the following table : 

Original size of group 83 

Irregular attendance 20 

Unusually irregular performance 3 
Apparent linearity of learning 

curve S 

Delayed positive acceleration 2 

Demonstrable plateau 2 

Total eliminated 32 

Size of group for major study. . . 51 



THE LEARNING CURVE EQUATION 27 

The twenty subjects eliminated from the major study on ac- 
count of irregular attendance are not of interest in this connec- 
tion. Three subjects were eliminated for extremely erratic 
performance in the tests. It is impossible that their real type- 
writing ability is even approximately represented by their erratic 
scores. The cause for their variability is undoubtedly due to 
lack of consistent interest in their work and in the tests. Most 
of the subjects took a competitive attitude toward the tests, the 
results of which were given them weekly by their instructor. 

Ten subjects had learning curves which deviated from the 
typical hyperbolic form which we are here considering. This 
is a limitation of our method which is only applicable to the 
hyperbolic form of the speed-amount curve. Of these ten sub- 
jects five were eliminated from the major study on account of 
apparent linearity of the learning curves. No learning curve 
can ever be continuously linear if it is plotted in the speed* 
amount form. If it were linear the subject would have no 
physiological limit and he would in time reach the rather en- 
viable attainment of infinite writing speed, which is of course 
absurd. Another alternative with a linear learning curve is that 
it is linear until it reaches the practice limit after which it re- 
mains at the limit. I can not entertain this as a possibility for 
it is inconceivable that an organic function like learning pro- 
ceeds according to a linear relation until it bumps into some 
inflexible practice limit at which it stops and remains. The only 
possible explanation of apparently linear learning curves that 
I am willing to entertain is that they are in reality curved but 
that the degree of curvature is so small that it is concealed by 
the variability, of the individual observations. Such learning 
curves are therefore indeterminate unless they be continued far 
enough to make the curvature appear in spite of the variations 
of the individual observations. This leads to the conclusion 
that the accuracy with which the learning coefficients can be 
determined is contingent on two principal factors. It varies 

a) with the degree of curvature of the learning curve, and 

b) inversely as the variability of the individual measurements. 
The coefficients of a learning curve with minimum variability 
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may be determined with a minimum amount of visible curvature. 
The more variable the measurements the greater is the degree 
of curvature necessary for a fairly accurate determination of 
th learning curve coefficients. Whether the linearity of the 
curves of these five subjects is apparent or real can not be settled 
with the available data. If the linearity is real it constitutes a 
limitation in the use of the learning curve equation. 

Two subjects were eliminated on account of delayed positive 
acceleration. Their learning curves constitute deviations from 
the usual shape of curve and can not be handled by the methods 
which we are discussing here. It is not certain that these meas- 
ures are not simply cases of erratic performance. 

Two out of the eighty-three subjects showed clear evidence 
of a plateau. Whether this is psychologically significant or 
simply due to the fact that these subjects were offered positions 
after attaining a specified typewriter proficiency is indetermina- 
ble. The higher order learning curve which followed the first 
curve is not carried far enough with either of these two sub- 
jects to justify determining the learning coefficients for the first 
and second order curves. 

We have eliminated twelve out of sixty-three complete rec- 
ords. Generalizing from this fact we may conclude that the 
speed-amount form of learning for typewriting takes the hyper- 
bolic form in about four cases out of five. This justifies our 
reference to it as the typical but not as the universal form of 
learning curve. 

2) The Coordinates of Curves for Typewriting 

My first intention was to plot the learning curves with speed 
as ordinates and time in weeks as abscissae, the speed-time 
form. Finding that the industry of the subjects during the 
practice hours varied immensely I decided that the psychological 
analysis would be more equitable if I measured practice in terms 
of total number of pages written rather than in terms of time, 
although time is statistically more readily obtained than the 
amount of practice. The practice sheets were all turned in to 
the teacher in charge who tabulated every week the number of 
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pages written by each subject. According to the typewriter 
championship rules, attainment should be scored by deducing 
five words from the speed total for every error. For the pur- 
pose of psychological study I have separated errors from speed. 
The learning curves are all plotted as speed (words in four min- 
utes, disregarding errors) against formal practice (pages writ- 
ten since entering the course). While errors are entirely dis- 
regarded in these curves the subjects were of course not informed 
on this point. The errors are studied separately by correlating 
them with the other learning characteristics. In this manner 
we shall arrive at a statement of the relationship between the 
several learning characteristics without artificially loading them 
with each other, as would be the case if we penalized the score 
for speed by the number of errors. 

3) The Learning Coefficients for Typewriting 

We shall use the following notation in studying typewriter 
learning. 

X = Practice, in terms of the total number of pages written 

since entering the course. 
Y = Attainment, in terms of the number of words written 

in four minutes. 
x == Number of pages written when an individual test is 

taken. 
y = Number of words written in a four minute test. It is 

the observed speed whereas Y is the speed indicated by 

the learning curve equation, 
n = Number of tests taken. 
y a = Average speed in all tests or 

2y 

y* = — 

n 

y 20 = Average speed after twenty weeks of practice, and sim- 
ilar notation for the average speed at other stages of 
learning. 

L = Predicted practice limit in terms of words written in 
four minutes. 

R = The rate of learning, a constant which is numerically 
large for a low rate of approach and numerically low 
for a rapid rate of approach. 

P = Equivalent previous practice in terms of practice units 
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(pages written). It is the negative x-intercept of the 
speed-amount curve. 

d = Absolute deviation which expresses the deviation of any 
single observation above or below the value indicated 
by the learning curve at that stage of learning. It is 
positive when the speed of any single test is above that 
indicated by the learning curve, and negative when the 
actual speed is below the learning curve. It can also 
be represented by the relation 
d = y — Y 

D = Average deviation for all tests during the year, or 

2d 

D = 

n 

d r = Relative deviation of an individual test, determined by 

the ratio 

_ d _ y — Y 

r ~ Y" Y~ ' 

D r = Average relative deviation, determined by the ratio 

n 
V = Coefficient of variability, determined by the ratio 

D 

V = * 

e = Number of errors made in an individual test as deter- 
mined by the Typewriter Championship rules. 
E = Average number of errors in all tests, or 

2(e) 
E= V J 
n 

e r = Relative inaccuracy of an individual test, or 

e 
e r = — 

y 

E r = Average relative inaccuracy for all tests, or 

n 
A = Coefficient of inaccuracy, determined by the ratio 

E 
A = 
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4) Findings 

a) Writing Speed 

Figure 5 indicates the relation between average speed and 
number of pages practiced. The law of diminishing returns is 
shown by the continuity of the points representing average speed 
but we can not assert the universality of this form because we 
have already eliminated 12 out of the 63 available records. It 
will serve well as a norm of average performance for groups 
comparable with the one here represented. It is interesting to 
note that the curve does not pass through the origin. This is 
explainable by the fact that a person who has never written on 
a typewriter can, nevertheless, even on the first trial, make some 
finite score. I could not readily obtain a test from the fifty-one 
subjects prior to formal instruction on the typewriter. In order 
to ascertain what a truly initial score is, I asked ten of my stu- 
dents who had never touched a typewriter to take a four minute 
test. The mean as well as the average of these ten subjects was 
27 words in four minutes. This is the point to which the compo- 
site learning curve in figure 5 projects. 

The solid line of figure 5 represents the hyperbolic curve form. 

It is a good fit on the data which are represented by the small 

circles. The equation of the composite curve is 

2i6.(X+i9.) 

Y = 40 

X+148. 

in which Y is the average score for the 51 subjects and X is the 
number of pages of practice. The predicted limit of practice 
for speed is 216. words in four minutes, which agrees well with 
average typewriter speed. The average of the limits of practice 
as determined from the individual curves is 214. The equivalent 
previous practice for the composite curve is 19. This indicates 
that the general experience which these subjects brought to their 
first practice on the typewriter was equivalent to nineteen pages 
of formal practice. The rate of learning, R, for the composite 
curve is 129, a constant which varies inversely with the relative 
rapidity with which the limit of practice is approached. The 
average of the rates of approach as determined from the indi- 
vidual curves is 137. The composite curve, figure 13, is plotted 
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against time in weeks instead of against amount of practice. It 
has an entirely different appearance. It shows the initial posi- 
tive acceleration previously discussed. 

We shall now turn to the individual records and ascertain by 
correlation methods the interrelations of the learning character- 
istics for typewriting. 

The correlation between the predicted limit of practice and 
the average speed of writing for all tests during the year is 
0.68. This correlation could not even theoretically be close to 
unity for while the fast writers tend to approach the higher 
practice limits there is considerable variation in the rate at which 
the limit is approached. It should be noted that the predicted 
limit was not attained by these subjects. The limit is predicted 
on the basis of the curve shape. If the predicted limit used here 
agrees at all closely with the ultimate writing speed of these sub- 
jects, the correlation of 0.68 between the practice limit and the 
average writing speed during eight months' instruction indicates 
that the latter measure is not a very reliable criterion of ultimate 
proficiency. I think that it constitutes another piece of evidence 
against hasty and self-confident predictions based on so called 
vocational mental tests. 

The correlation between the predicted limit of practice, L, 
and the rate of learning, R, is 0.75. This indicates that those 
who approach a high limit of practice in speed generally ap- 
proach their limit at a lower rate than those who are approach- 
ing a low speed as their limit. The regression is linear and hence 
the converse is true, namely that those who approach a low limit 
of practice generally approach their limit at a relatively higher 
rate than those who have a high limit of practice. 

The correlation between predicted limit of writing speed and 
average number of errors in unit time is 0.10. Hence we con- 
clude there is no discernible relation between the normal writing 
speed and the absolute number of errors made per unit time. 
But there is a noticeable relation between the coefficient of in- 
accuracy, A, and the practice limit, L. The coefficient of corre- 
lation is only — 0.16 but the regression is non-linear. Those 
who approach a low practice limit for writing speed tend to be 
inaccurate, but those who approach a high practice limit are 
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either accurate or inaccurate. On the other hand those who are 
unusually accurate tend to be fast writers whereas those who 
are inaccurate are either fast or slow writers. The number of 
subjects is not large enough to warrant the calculation of the 
eta coefficient. 

The correlation between the predicted writing speed and the 
speed after eight weeks of practice is 0.27. Making use of the 
actual data instead of predicted performance, we find that the 
correlation between the speed after eight weeks practice and 
that after twenty weeks practice is 0.74. The correlation be- 
tween predicted limit and the average writing speed for all tests 
is 0.68. 

In this connection I wish to suggest what will perhaps be a 
more reliable technique in psychological prognosis. If a high 
degree of relationship can be established between the learning 
curve constants for a complex function, performance in which 
is to be predicted, and the corresponding constants for a simple 
learning function which can be completed during a single sitting, 
then the constants for the simple learning test would have diag- 
nostic value in predicting performance in the complex function. 
Such coefficients would not be subject to the accidents of a first 
performance but would represent the organically more significant 
learning function as such. The diagnostic value of such a tech- 
nique is largely dependent on the degree of difficulty of the 
material to be learned. 
b) The errors 

The relation between the average absolute number of errors 
made in each test and the number of weeks of practice is indi- 
cated in figure 16. This shows that the number of errors in unit 
time increases with practice. The relation between these two 
attributes may be expressed by the empirical equation 

e = o.i2T + 2.i 66 

in which e is the average number of errors in a four minute 
test, and T is the number of weeks of instruction. The equation 
expresses a norm of average performance. 

The analogous relation between average absolute number of 
errors and writing speed during the year is shown in figure 17. 
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This also indicates that the number of errors made in unit time 
increases with the attainment of writing speed. The relation 
may be expressed by the empirical equation 

e = o.023y + i.5 67 

which is fairly representative within the limits of observation. 
However, the relative inaccuracy decreases with practice as in- 
dicated in figure 18, i.e., the number of errors per page decreases 
with practice but the number of errors per unit time increases 
with practice. There is no discernible relation between the rela- 
tive inaccuracy (errors per unit time) and the rate of learning. 
c) The variability 

Those who have a high practice limit for writing speed usually 
have a larger average deviation from their learning curves than 
those who approach a low practice limit. This statement must 
be considered in connection with relation between predicted limit 
of writing speed and the average relative deviation. The corre- 
lation between predicted writing speed and average relative de- 
viation for all tests is 0.27, indicating that the fast writers have 
a slight tendency to be more erratic in speed than the slow writers, 
even though the measure of variability is taken as the ratio of 
deviation to writing speed. According to this measure the writer 
of 60 words per minute is allowed a deviation from his repre- 
sentative learning curve twice that of a writer of 30 words per 
minute. But even according to this relative standard of varia- 
bility the fast writers tend to be slightly more erratic in speed. 

Figure 14 indicates that the deviations from the learning curve 
increase with practice but figure 1 5 shows that the ratio of devia- 
tion to theoretical writing speed, as indicated by the curve, de- 
creases with practice. It is apparent that, just as one would 
expect, the absolute deviations increase with practice but the 
relative deviations decrease with practice. The decrease in the 
relative deviation with practice does not become noticeable until 
after about three months but after that the relation is approxi- 
mately linear with practice time. In other words, the variability 
of the writing speed for any individual subject tends to decrease 
with practice, but if he is a fast writer he tends to be more 
variable in his writing speed than if he is a slow writer, even 
when the variability is measured in relative terms. 



SUMMARY 

i ) Forms of the Learning Curve 

We have discussed four different forms in whiUi most learn- 
ing data can be graphed. We have called these forms i) the 
speed-amount curve, 2) the speed-time curve, 3) the time-time 
curve, and 4) the time-amount curve. 

The speed-amount curve is plotted as speed, number of suc- 
cessful acts per unit time, or a multiple thereof, against the total 
number of formal practice acts, or some multiple of it. In plot- 
ting typewriter learning we have used words in four minutes, 
and the total number of pages written as the coordinates of the 
speed-amount curve. This form of curve is illustrated in figure 
5 which gives average writing speed for fifty-one subjects against 
total number of pages written. The small circles indicate the 
observations and the solid line indicates the general trend of the 
learning. The solid line is represented by the general equation 

L(X+P) 
Y = 20 

(X+P)+R ~ v 

in which 

L = Predicted practice limit in terms of speed units. 

X = Pages written. 

Y== Writing speed in terms of words in four minutes. 

P = Equivalent previous practice in terms of pages. 

R = Rate of learning, a constant which varies inversely as 

the relative rapidity with which the practice limit is being 

approached. 
K = P + K, a constant used for convenience. 
The particular line of figure 5 is represented by the equation 

Y ^ 2i6.(X+i 9 .) 

X+148. 
which we may interpret as follows. The predicted average prac- 
tice limit, L, for the group of fifty-one subjects is 216 words 
in four minutes or about 54 words a minute. The rate of learn- 
ing, R, is a constant which in this curve has the value of 129. 
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Its only usefulness is in comparing the rates of several learning 
curves with each other. By itself, and for a single curve, it has 
no significance. When used to compare several learning curves 
the precaution must be observed that all curves so compared be 
plotted by the same units for the coordinates. The equivalent 
previous practice is nineteen pages. This is interpreted to mean 
that the general experience which these students brought to their 
first instruction on the typewriter was equivalent to writing nine- 
teen pages on the machine. This coefficient as well as the pre- 
dicted limit is based on the assumption that the unmeasured part 
of the learning before and after the observations followed the 
hyperbolic law. This assumption seems to be fairly safe since 
other learning curves, the actual observations for which start 
with practically zero attainment and continue almost to the prac- 
tice limit, usually follow the hyperbolic form. See figure n 
which represents a substitution test learning curve carried almost 
to the practice limit, and the curve for subject No. 23 in figure 3 
for typewriter learning which projects to the origin. 

The time-amount curve is plotted as time, t, per unit amount 
of work against number of formal practice acts, X. Its equa- 
tion is 

X+K 

t = 49 

L(X+P) 

with notation similar to that of equation 20. Learning data can 
be changed from the time-amount form into the speed-amount 
form and vice versa by noting the fact that speed, Y, is propor- 
tional to the reciprocal of the time, t, per unit amount of work. 

The time-time curve is plotted as time, t, per unit amount of 
work against total practice time, T. Its equation, 54, is derived 
from equation 20. This equation is too cumbersome for prac- 
tical work and the speed-amount or time-amount curves should 
therefore be used unless one adopts a simple empirical equation 
for the time-time form. 

The speed-time curve is plotted as speed, Y, against total prac- 
tice time, T. Its equation, 55, is too unwieldy for practical work 
but it serves to demonstrate the following proposition regarding 
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<c; 



positive acceleration. If we assume that the typical form of 
::: speed-amount curve is hyperbolic, then the learning curve in 

r: the speed-time form must necessarily have an initial positive 

ec acceleration which changes to a negative acceleration when the 

,i: attainment has reached one-third of the limit of practice. The 

m positive acceleration can not, of course, be obtained when the 

i initial score is greater than one-third of the practice limit. It 

can only be observed when the initial score is less than one-third 
of the practice limit. 

The influence of different values for the learning coefficients 
ii on the shape of the learning curve may be summarized in the 

following comparisons. Figure 7 shows two learning curves 
approaching two different limits at the same rate. Figure 8 
shows two curves approaching the same limit at different rates. 
Figure 9 shows two curves approaching the same limit at the 
same rate but differing in the amount of previous practice. 
Curve A has a start of forty practice units over curve B. Fig- 
ure 10 shows two curves, one approaching a high limit at a low 
rate, the other approaching a lower limit at a high rate. The 
important feature of this comparison is that one who learns 
rapidly but with a low limit will do better in the first stages of 
the learning than one who learns slowly with a high limit. The 
45 comparison shifts later in favor of the learner with the high 

! limit. This is a condition which experimenters on learning 

1 should be on the look-out for in order to guard against the er- 

roneous comparison of two subjects from insufficient practice 
data. 

of 2) Outline for Calculating the Learning Coefficients 

ed We shall describe two methods of calculating the coefficients. 

tc- These are 1) the method of all points, and 2) the method of 

Id three points. 

1 ) Method of all points: 

1 ) Arrange the data in two columns as follows : X, the total 
number of formal practice acts since the beginning of practice, 
or a multiple of this number, and Y, the number of successful 
acts in unit time, or a multiple of this number. The multiple 
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used for the Y-column need not of course be the same as that 
for the X-column. 

2) Draw a chart analogous to figure 5. Leave room for a 
negative x-intercept. Always include the zero point of the 
y-scale on the chart. 

3 ) Select a representative initial score. In figure 5 the actually 
observed initial score, 27, was used. Denote this by the sym- 
bol Y x . 

4) Compute the values of X/(Y— Y t ) and XY/(Y— Y t ) 
for each observation. Arrange these in two columns. 

5) Draw a chart analogous to figure 6 with the coordinates 
determined in step 4. If the data so plotted fall nearly on a 
straight line the speed-amount curve is hyperbolic. If it does 
not, the use of the learning curve equation is not justified and 
other methods must be resorted to. 

6) Fit a straight line through these points in figure 6 in such 
a manner that there are about as many points on one side of 
the line as there are points on the other side. This procedure 
is called "rectifying the equation." The line can be fitted more 
accurately by the method of least squares but since that is a 
rather laborious procedure it should be avoided unless the points 
are so badly scattered that they can not readily be fitted by in- 
spection. Even then it is doubtful whether one is justified in 
applying the equation to learning data so erratic that a straight 
line can not be fitted by inspection. 

7) Continue this line until it intersects the x-axis. The x- 
intercept gives the value of IC/Y^ and since the value of Y x is 
already known the value of K can be readily determined. 

8) The slope of the line is numerically equal to the predicted 
limit, L. 

9) The constant P may then be determined from the equation 

K-Yi 

P = - 

L 

10) The constant R is then determined by the equation 

R = K — P 
since K and P are known. 
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2) Method of three equidistant points. 

The first two steps of this method are identical with the first 
two steps in the method of all points. 

3) Draw a smooth curve through the observations. If the 
data show irregularities in the rate of learning draw the smooth 
curve so that it has approximately as many observations above 
the line as there are observations below the line. A ragged line 
through all the more or less erratic observations will not serve 
the purpose. If the smooth curve representative of the data is 
not of the hyperbolic form the method of this learning curve 
equation is not applicable. 

4) Select the three following points: 

in which X x is zero, and Y ± is the ordinate to the smooth curve 
at this value of X. If the learning data have no irregularities 
the value of Y x will be identical with the initial score. 

X 2 ; Y 2 
in which X 2 is one half of the total amount of formal practice 
and Y 2 is the representative ordinate to the curve for this value 
of X. 

X 8 ; Y 8 
in which X 3 is the total amount of formal practice and Y 3 is the 
ordinate to the smooth curve at this value of X. If the learning 
data show no irregularities, this will be identical with the final 
score. 

5) Determine the numerical value of the constant K from 
the equation: 

X»(Y 2 -Y 8 ) 

Ys+Y!— 2Y 2 

6) Determine the numerical value of constant L by means 
of the following equation: 

L = Y 8 (X 8 +K)-Y 1 .K 

X 8 

7) Determine the numerical value of the constant P by the 

following equation: 

Y r K 
P = — - — 
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8) Determine the numerical value of the constant R by the 
following equation : 

R = K— P 

After the constants of the learning curve equation have been 
numerically evaluated it is best to check the arithmetical work 
by computing the theoretical value of the attainment for one 
or two points according to the following formula. These the- 
oretical values of attainment should not differ much from the 
actually observed values unless the learning has been very erratic. 

y= L(X+P) 

(X+P)+R 
A gross measure of the variability of the learning may be 
determined from the equation 

D 

V = 

in which D is the average deviation from the theoretical curve 
for all the observations, and y a is the average attainment as 
determined from all observations. 
3) Typewriter Learning 

The following relations were found to be significant with re- 
gard to typewriter learning. 

The correlation between the predicted practice limit and the 
average writing speed for all tests which covered about seven 
months is +0.68. The correlation between practice limit and 
rate of learning is +0.75. This indicates that those who ap- 
proach a high practice limit usually do so at a lower rate than 
those who approach a low limit since the coefficient, R, for the 
rate of learning varies inversely with the rate of learning. There 
is a noticeable relation between accuracy and the predicted prac- 
tice limit. Those who approach a low practice limit for writing 
speed are usually inaccurate, but those who approach a high 
practice limit are either accurate or inaccurate. On the other 
hand those who are inaccurate are either fast or slow writers. 
The number of subjects, 51, is not large enough to warrant the 
calculation of the eta-coefficient. The correlation between the 
predicted practice limit and speed as determined in the test at 
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the 8th week is +0.27. There seems to be no relation between 
the predicted writing speed at the limit of practice and the num- 
ber of errors made in unit time. 

The number of errors made in unit writing time increases 
with practice. Similarly there is a positive relation between the 
number of errors in unit time and writing speed during practice. 
However, the number of errors per unit amount of work de- 
creases with practice. See figures 16, 17, and 18. There is no 
discernible relation between the relative accuracy and the rate 
of learning. 

Those who have a high practice limit for writing speed usually 
have larger relative deviations from their theoretical learning 
curves. The variability of the writing speed for any individual 
subject tends to decrease with practice, but if the student is a 
fast writer he tends to be more variable in writing speed, than 
if he is a slow writer, even when the variability is measured in 
relative terms. According to this standard of variability the 
writer of 60 words per minute is allowed a deviation from his 
representative learning curve twice that of a writer of 30 words 
per minute. But even according to this relative standard of 
variability the fast writers tend to be slightly more erratic in 
speed. 

Considerable ambiguity in discussions about learning curves 
has been caused by the comparison of learning curves with dif- 
ferent units for the coordinates. Thus we are entirely safe in 
saying that the speed-amount curve is never continuously linear. 
It would lead to infinite speed of performance which is of course 
absurd. But while that statement is obviously true it does not 
entitle us to jump to the denial of say linear error-time curves. 
It is quite possible for errors plotted against time to be linear. 
Therefore we should always specify the coordinates for the 
curves we are discussing. 

While I have confined myself throughout to what I have called 
the typical hyperbolic form of the speed-amount curve it is quite 
essential to keep in mind that this form of curve is not universal 
and that consequently it is impossible to make sweeping gen- 
eralizations except in so far as we explicitly limit ourselves to 
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the relations which follow from the assumed hyperbolic form 
with which we started. 

The preceding pages have been filled with so much algebraic 
manipulation that the reader who has long since dropped the 
algebraic thinking of his school days may find their very appear- 
ance formidable and distasteful. For the benefit of those who 
have acquired an aversion against symbolic notation I wish to 
call attention to the outline for calculating the coefficients and 
the section on learning curve forms in the summary. In those 
sections will be found all that is really essential in applying the 
method. If the use of empirical equations in the quantitative 
study of the multifarious aspects of memory is at all furthered 
by the present study I shall be content though the particular 
forms used here are superseded by others. 
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